Exploring the Utility of the Random Forest Method for Forecasting Ozone Pollution in SYDNEY
نویسندگان
چکیده
This paper explores the utility of an ensemble decision-tree method called random forest, in comparison with the classic classification and regression trees (CART) algorithm, for forecasting ground-level ozone pollution in the Sydney metropolitan region. Statistical forecasting models are developed to provide daily ozone forecasts in November-March for three subregions, i.e., Sydney east, Sydney south-west and Sydney north-west. The random forest models are evaluated in reference to the single decision-tree models developed from the classic CART algorithm. The results show that the random forest models outperform the CART models for forecasting high ozone pollution in Sydney south-west and Sydney north-west, the areas where the highest ozone pollution are observed. The random forest models also show a lift in forecasting skills in Sydney south-west if compared to the existing forecasting practice for the basin as a whole. These results suggest that random forest is a promising method for air quality forecasting in Sydney. This study promotes the application of a statistical ensemble approach to air quality forecasting.
منابع مشابه
Forecasting Ozone Density in Tehran Air Using a Smart Data-Driven Approach
Introduction: As a metropolitan area in Iran, Tehran is exposed to damage from air pollution due to its large population and pollutants from various sources. Accordingly, research on damage induced by air pollution in this city seems necessary. The main purpose of this study was to forecast ozone in the city of Tehran. Considering the hazards of ozone (O3) gas on human health and the environmen...
متن کاملForecasting Stock Trend by Data Mining Algorithm
Stock trend forecasting is a one of the main factors in choosing the best investment, hence prediction and comparison of different firms’ stock trend is one method for improving investment process. Stockholders need information for forecasting firm’s stock trend in order to make decision about firms’ stock trading. In this study stock trend, forecasting performs by data mining algorithm. It sho...
متن کاملTown trip forecasting based on data mining techniques
In this paper, a data mining approach is proposed for duration prediction of the town trips (travel time) in New York City. In this regard, at first, two novel approaches, including a mathematical and a statistical approach, are proposed for grouping categorical variables with a huge number of levels. The proposed approaches work based on the cost matrix generated by repetitive post-hoc tests f...
متن کاملOzone Peak and Pollution Forecasting Using Support Vectors
This paper investigates the efficiency of Support Vector Machines for ozone peak and air pollution forecasting. A specific methodology adapted to the data has been proposed. The method is based on regression estimation of the ozone concentration for a given day. Then for the classification problem (deciding whether that day is polluted or not), the regression value is stacked. Model selection p...
متن کاملCreep Life Forecasting of Weldment
One of the yet unresolved engineering problems is forecasting the creep lives of weldment in a pragmatic way with sufficient accuracy. There are number of obstacles to circumvent including: complex material behavior, lack of accurate knowledge about the creep material behavior specially about the heat affected zones (HAZ),accurate and multi-axial creep damage models, etc. In general, creep life...
متن کامل